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Office Action of April 28, 2008 for record purposes. 

DETAILED ACTION 

1 . The Amendment filed January 1 1 , 2008 in response to the Office Action of October 1 1 , 
2007 is acknowledged and has been entered. Previously pending claims 6, 7 and 17 have been 
cancelled, claims 4, 5, and 8-10 have been amended 

2. Claims 4, 5, and 8-10 are currently being examined. 

3 . The following rejections are being maintained: 

Claim Rejections - 35 USC § 112 

The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

4. Claims 4, 8, and 9 remain rejected under 35 USC 1 12, first paragraph for the reasons 
previously set forth in the Office Action of October 11, 2007, section 12-pages 13-19. 

Applicants argue that with respect to the Examiner's assertion that the nucleic acid set 
forth in SEQ ID NO: 3 can code for a protein or polypeptide that is present in the nucleus of the 
animal cell, the nucleic acid set forth in SEQ ID NO: 3 can be present both in the nucleus and the 
cytoplasm. 

Applicants argue that in order to assist the Examiner, claim 4, upon which claim 8 also 
depends, has been amended to define the nucleic acid recited in the claims as the nucleic acid set 
forth in SEQ ID NO: 3 and a nucleic acid strand that is completely complementary to the nucleic 
acid set forth in SEQ ID NO: 3. As acknowledged by the Examiner in the Office Action, the 
present application discloses SEQ ID NO: 3. The present application discloses that the term 
"nucleic acid of the present invention" can include a complementary strand selected from 
information of the nucleic acid set forth in SEQ ID NO: 3 (page 22). As is also acknowledged by 
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the Examiner, it is known in the art that the phrase "complementary strands of nucleic acids" can 
include nucleic acids that are completely complementary to the claimed polynucleotide. 

Applicants arguments have been considered, but have not been found persuasive as the 
claims are still drawn to fragments of SEQ ID NO: 3. Claims 4, 7 , and 9 are drawn to "A 
recombinant vector comprising a purified nucleic acid coding for a (emphasis added) protein 
or polypeptide . . . wherein the nucleic acid is set forth in (emphasis added) SEQ ID NO: 3 or is 
a nucleic acid that is completely complementary to the nucleic set forth in (emphasis added) 
SSEQ ID NO: 3. . ." and a host cell transformed with the vector and a method of producing the 
protein. This claim construction clearly reads on fragments of SEQ ID NO: 3 as the claims are 
not drawn to the protein or polypeptide produced from a nucleic acid comprising SEQ ID NO: 3. 
The claims are not enabled to make a fragment of SEQ ID NO: 3 that is present in the nucleus 
and which has a transcription factor function and/or a function that can induce expression of 
retinoblastoma gene or a gene product thereof, given that the specification has not identified the 
regions of the encoded polypeptide that are required for these functions and given the 
unpredictability of protein biochemistry and predicting function from structure previously set 
forth. Thus undue experimentation would be required to identify fragments that encode a protein 
with the claimed functions. 

Applicant's arguments have not been found persuasive and the rejection is maintained. 
5. Claims 4, 8 and 9 remain rejected under 35 USC 1 12, first paragraph for the reasons 
previously set forth in the Office Action of October 11, 2007, section, 13, pages 19-24. 

Applicants argue that in order to assist the Examiner, claim 4, upon which claim 8 also 
depends, has been amended to define the nucleic acid recited in the claims as the nucleic acid set 
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forth in SEQ ID NO: 3 and a nucleic acid strand that is completely complementary to the nucleic 
acid set forth in SEQ ID NO: 3. As acknowledged by the Examiner in the Office Action, the 
present application discloses SEQ ID NO: 3. As is also acknowledged by the Examiner, it is 
known in the art that the phrase "complementary strands of nucleic acids" can include nucleic 
acids that are completely complementary to the claimed polynucleotide. 

Applicants arguments have been considered, but have not been found persuasive as the 
claims are still drawn to fragments of SEQ ID NO: 3. Claims 4, 8 , and 9 are drawn to "A 
recombinant vector comprising a purified nucleic acid coding for a (emphasis added) protein 
or polypeptide . . . wherein the nucleic acid is set forth in (emphasis added) SEQ ID NO: 3 or is 
a nucleic acid that is completely complementary to the nucleic set forth in (emphasis added) 
SSEQ ID NO: 3. . ." and a host cell transformed with the vector and a method of producing the 
protein. This claim construction clearly reads on fragments of SEQ ID NO: 3 as the claims are 
not drawn to the protein or polypeptide produced from a nucleic acid comprising SEQ ID NO: 3. 
Thus, the claims are not enabled to make a fragment of SEQ ID NO: 3 that is present in the 
nucleus and which has a transcription factor function and/or a function that can induce 
expression of retinoblastoma gene or a gene product thereof given that the specification has not 
identify the regions of the encoded polypeptide that are required for these functions and undue 
experimentation would be required to identify fragments that encode a protein with the claimed 
functions. The level of skill and knowledge in the art is such that one of ordinary skill would not 
be able to identify without further testing which of these fragment of SEQ ID NO: 3 can code for 
a protein have the ability to be present in the nucleus and which has a transcription factor 
function and/or a function that can induce expression of retinoblastoma gene or a gene product 
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thereof. Thus one of skill in the art would not recognize that Applicants were in possession of 
the claimed genus. 

Applicant's arguments have not been found persuasive and the rejection is maintained. 
Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S. C. 102 that form the 

basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(a) the invention was known or used by others in this cotintn , or patented or described in a printed publication in this 
or a foreign country, before the invention thereof by the applicant for a patent. 

6. Claims 8-10 remain rejected and claims 4 and 5 are rejected under 35 
U.S.C. 102(a) as being anticipated by Chano et al. (Oncogene, February 14, 2002, 21:1295-1298, 
IDS) for the reasons forth in the Office Action of October 11, 2007, section, 15, pages 25-27. 

Applicants argue that three of the authors listed in Chano et al., Chano, Ikegawa, and 
Okabe, are the also the inventors of the present application. The remaining three authors listed in 
Chano et al, Kontani, Baldini, and Saeki, were working under the direction of the present 
inventors and their contributions were not of an inventive nature. Applicants submit herewith a 
Declaration under 37 C.F.R. § 1.132 to further establish that the authors of Chano et al. are the 
inventors of the present application. As such, Applicants respectfully submit that Chano et al. 
does not qualify as an invention known or used by "others" within the meaning of 35 U.S.C. 
§102(a). 

The Declaration under 37 CFR 1.132 filed January 11, 2008 is insufficient to overcome 
the rejection of claims 4, 5 and 8-10 based upon Chano et al. (Oncogene, February 14, 2002, 
21 : 1295-1298, IDS) as set forth in the last Office action because: The Declarants state in 
section 1 : 
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We, Tokuhiro Chano, Shiro Ikegawa, and Hidetoshi Okabe, do declare and state as 
follows: We are three of the six named inventors (emphasis added) of the present 
application identified above. 

Given the statement that "We are three of the six named inventors" and the identity of the 
other three inventors has not been made known to the Office nor have six inventors signed the 
Declaration, the Declaration under 37 CFR 1.132 is not an unequivocal statement from the 
Applicant regarding the subject matter disclosed in the article and has not properly executed, see 
MPEP 7 1 6. 1 0 and CFR 1 .63 . Thus, the Declaration under 37 CFR 1.132 filed January 1 1 , 2008 
is insufficient to overcome the rejection. 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

7. Claim 10 remains rejected under 35 U.S.C. 103(a) as being unpatentable over AB059622 
(October 11, 2001), in view of Mensink et al (British J. Haematol. (August 1998) 102:768-774) 
and further in view of Buck et al (Bio techniques (1999) 27(3):528-536) for the reasons forth in 
the Office Action of October 11, 2007, section, 16, pages 28-30. 

Applicants argue that as acknowledged by the Examiner, AB059622 does not teach the 
particular primers of SEQ ID Nos: 19 and 20. Mensink et al. and Buck et al. also do not, alone or 
in combination, teach or suggest SEQ ID Nos: 19 and 20. With reference to the Examiner's 
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assertion that published sequences may be analyzed by commercially available software for 

primer selection in many cases, one can use the "Primer 3 website" (primer3.sourceforge.net) for 

this purpose rather than the commercially available software taught in Mensink et al. Simply by 

knowing the nucleotide sequence information, one can use the "Primer 3 website" to analyze 

primer design with general versatility. However, only after using the designed primer, can one 

obtain useful information on whether or not it is applicable to an experiment or clinical. Thus, 

one cannot determine if a nucleotide sequence is useful, simply because the sequence is known. 

Accordingly, one of ordinary skill in the art would not be able to arrive at the particular primers 

of SEQ ID NOs: 19 and 20, simply because of the disclosure of AB059622. 

Applicants arguments have been considered, but have not been found persuasive because 

of the availability in the art of primer design programs in the art at the time the invention was 

made and the teaching of Buck that every single primer tested of the 164 primers tested 

functioned as expected, one of skill in the art would have a reasonable expectation of success 

given that sequence was known in the art at the time the invention was made. 

Applicant's arguments have not been found persuasive and the rejection is maintained. 
New Grounds of Rejection 
Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S. C. 102 that form the 

basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public use or on 
sale in this country, more than one year prior to the date of application for patent in the United States. 

8. Claims 4, 5, and 8 are rejected under 35 U.S.C. 102(b) as being anticipated by 
Nagase et al. (DNA Research, 1996, 3:321-329) as evidenced by Nomura et al. (DNA Research, 
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1994: 1: 27-35), Chano et al. (Oncogene, February 14, 2002, 21:1295-1298, IDS) and Appendix 
1. 

The claims are drawn to: 

4. A recombinant vector comprising a purified nucleic acid coding for a protein or 
polypeptide which is present in nucleus of human or animal cell and which has a transcription 
factor function and/or a function that can induce expression of retinoblastoma gene (RB 1 gene) 
or a gene product thereof the polypeptide or protein according to claim 1, or a complementary 
strand thereof, wherein the nucleic acid is set forth in SEQ ID No: 3 or is a nucleic acid strand 
that is completely complementary to the nucleic acid set forth in SEQ ID No: 3. 

5. A recombinant vector comprising a nucleic acid hybridizing under stringent 
conditions with the a purified nucleic acid set forth in SEQ ID No: 3 or a nucleic acid strand that 
is completely complementary to the nucleic acid set forth in SEQ ID No: 3 according to claim 3 
or the complementary strand thereof; wherein the stringent conditions comprise a condition 
under which a positive hybridization signal is still observed even after heating at 42 °C in a 
solution of 6 x SSC, 0.5% SDS and 50% formamide, and washing at 68 °C in a solution of 0.1 x 
SSC and 0.5% SDS. 

8. A trans formant that was transformed with the recombinant vector according to claim 4. 

Nagase et al. teach the cloning of the cDNA KIAA0203, which 99.3% identical to SEQ 
ID NO: 3 and codes for a protein identical to RB1CC1, see Table 1 of Nagase et al. and 
Appendix 1 . Nagase et al. used the methods Nomura et al. for cloning the cDNA, see Materials 
and Methods, and reference 1 of Nagase et al. Nomura et al. teach that cDNA were cloned and 
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placed into the pBluescript SK+ cDNA vector and used to make cDNA libraries that were grown 
in colonies of cells, see p. 28, 1 st col, of Nomura et al. 

Chano et al. teach that RBI CO can induce the expression of the RBI gene, see Abstract, 
Fig. 2 and Fig.4. 

Although the reference does not specifically state that KIAA0203 codes for a protein or 
polypeptide which is present in nucleus of human or animal cell and which has a transcription 
factor function and /or a function that can induce expression of retinoblastoma gene (RBI gene) 
or a gene product thereof, given the teaching of Chano ct al. The claimed product appears to be 
the same as the prior art product, absent a showing of unobvious differences. The office does not 
have the facilities and resources to provide the factual evidence needed in order to establish that 
the product of the prior art does not possess the same material, structural and functional 
characteristics of the claimed product. In the absence of evidence to the contrary, the burden is 
on the applicant to prove that the claimed product is different from those taught by the prior art 
and to establish patentable differences. See In re Best 562F.2d 1252, 195 USPQ 430 (CCPA). 
Claim Rejections - 35 USC § 103 

The factual inquiries set forth in Graham v. John Deere Co., 383 U.S. 1, 148 USPQ 459 
(1966), that are applied for establishing a background for determining obviousness under 35 
U.S.C. 103(a) are summarized as follows: 

1 . Determining the scope and contents of the prior art. 

2. Ascertaining the differences between the prior art and the claims at issue. 

3. Resolving the level of ordinary skill in the pertinent art. 

4. Considering objective evidence present in the application indicating obviousness 
or nonobviousness. 
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This application currently names joint inventors. In considering patentability of the 
claims under 35 U.S.C. 103(a), the examiner presumes that the subject matter of the various 
claims was commonly owned at the time any inventions covered therein were made absent any 
evidence to the contrary. Applicant is advised of the obligation under 37 CFR 1 .56 to point out 
the inventor and invention dates of each claim that was not commonly owned at the time a later 
invention was made in order for the examiner to consider the applicability of 35 U.S.C. 103(c) 
and potential 35 U.S.C. 102(e), (f) or (g) prior art under 35 U.S.C. 103(a). 

9. Claims 4, 5, 8, and 9 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over AB059622 (October 11, 2001) as evidenced by Chano et al. (Oncogene, February 14, 2002, 
21:1295-1298, IDS), in view of US Patent No. 4,889,806 (Dec. 1989) and Sambrook et al 
(Molecular Cloning, A Laboratory Manual, Cold Spring Harbor, 1989, pp. 16.3-4). 

The claims are drawn to: 

4. A recombinant vector comprising a purified nucleic acid coding for a protein or 
polypeptide which is present in nucleus of human or animal cell and which has a transcription 
factor function and/or a function that can induce expression of retinoblastoma gene (RB 1 gene) 
or a gene product thereof the polypeptide or protein according to claim 1 , or a complementary 
strand thereof, wherein the nucleic acid is set forth in SEQ ID No: 3 or is a nucleic acid strand 
that is completely complementary to the nucleic acid set forth in SEQ ID No: 3. 

5. A recombinant vector comprising a nucleic acid hybridizing under stringent 
conditions with the a purified nucleic acid set forth in SEQ ID No: 3 or a nucleic acid strand that 
is completely complementary to the nucleic acid set forth in SEQ ID No: 3 according to claim 3 
or the complementary strand thereof; wherein the stringent conditions comprise a condition 
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under which a positive hybridization signal is still observed even after heating at 42 °C in a 
solution of 6 x SSC, 0.5% SDS and 50% formamide, and washing at 68 °C in a solution of 0.1 x 
SSC and 0.5% SDS. 

8. A trans formant that was transformed with the recombinant vector according to claim 4. 

9. A method for producing a protein or polypeptide which is present in the nucleus of a 
human or animal cell and which has a transcription factor function and/or a function that can 
induce expression of retinoblastoma gene (RB 1 gene) or a gene product thereof or a 
complementary strand thereof, wherein the nucleic acid is set forth in SEQ ID NO" 3 the 
polypeptide or protein according to claim 1 , comprising a step of culturing the transformant 
according to claim 8 with the recombinant vector containing nucleic acid coding for the 
polypeptide or protein. 

AB059622 teaches as previously set forth in the Office Action of October 11, 2007, 
section 14, pages 24-25, but does not teach a recombinant vector comprising SEQ ID NO: 3, a 
transformant transformed with the recombinant vector, or a method for producing protein using 
the recombinant vector. 

US Patent No. 4,889,806 teach that with the advent of recombinant DNA and molecular 
cloning technology it is now conventional to transfer genetic information into plasmids or 
vectors constructed in vitro and then transferred into host cells and clonally propagated (col. 1, 
lines 18-24). 

Sambrook et al teach that cloned genes are conventionally expressed using expression 
vectors and that expression of cloned proteins have been used to: (1) confirm the identity of a 
cloned gene by using immunological or functional assays to detect the encoded protein; (2) 
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produce large amounts of proteins of biological interest that are normally available in only 
limited quantities from natural sources; (3) to study the biosynthesis and intracellular transport of 
proteins following their expression in various cell types; and (4) to elucidate structure-function 
relationships by analyzing the properties of normal and mutant proteins (para bridging pages 
16.3 and 16.4). 

It would have been prima facie obvious to one of ordinary skill in the art at the time the 
invention was made to make a recombinant vector with the nucleic acid sequence of AB059622, 
transform the vector into a host cell and produce a protein with the methods of Sambrook et al 
and US Patent No. 4,889,806 because US Patent No. 4,889,806 specifically teaches that it is 
conventional to transfer genetic materials into plasmids or vectors and then transfer the plasmids 
or vectors into host cells and clonally propagate the genetic material and because Sambrook et al 
teach that cloned genes are conventionally expressed using expression vectors. 

One of ordinary skill in the art at the time the invention was made would have been 
motivated to make a recombinant vector with the nucleic acid sequence of AB059622 with the 
methods of Sambrook et al and US Patent No. 4,889,806 because Sambrook et al specifically 
teach that expressed cloned proteins are used to: (1) confirm the identity of a cloned gene by 
using immunological or functional assays to detect the encoded protein; (2) produce large 
amounts of proteins of biological interest that are normally available in only limited quantities 
from natural sources; (3) to study the biosynthesis and intracellular transport of proteins 
following their expression in various cell types; and (4) to elucidate structure-function 
relationships by analyzing the properties of normal and mutant proteins. Given the conventional 
nature of the methods, one of skill in the art would have had a reasonable expectation of success. 
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Priority 

10. Applicants state that at page 2, item 6, of the Office Action, the Examiner has 
acknowledged receipt of papers submitted under 35 U.S.C. § 1 19(a)-(d), which papers have been 
placed of record in the file. The Examiner recognizes a priority date of January 30, 2003. The 
Examiner indicates that because the priority of the instantly claimed invention is based on 
Japanese Application Nos. 2002-161400 and 2002-214978, and translations have not been 
provided, the Examiner is unable to recognize an earlier priority date. The Examiner suggests 
that Applicants submit a translation of the priority documents and to point to page and line where 
support can be found establishing an earlier priority date. 

Applicants argue that English translations arc not required for claiming priority. 
According to MPEP § 201.15, the actual merits of an applicant's claim of priority is considered 
by the Examiner only when a reference is found with an effective date between the date of the 
foreign filing and the date of filing in the United States. None of the publication dates of the 
references cited by the Examiner appears to fall within this range. As such, the priority dates of 
the Japanese applications should be recognized. 

Applicants' arguments have been considered and the conditions for foreign priority 
Japanese Application Nos. 2002-161400 and 2002-214978 have been met. 

1 1 . All other objections and rejections recited in Office Action of October 1 1 , 2007 are 
withdrawn. 

12. No claims allowed. 

13. Any inquiry concerning this communication or earlier communications from the 

examiner should be directed to Peter J. Reddig whose telephone number is (571) 272-903 1 . 
The examiner can normally be reached on M-F 8:30 a.m.-5:00 p.m. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Larry Helms can be reached on (571) 272-0832. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/Peter J Reddig/ 
Examiner, Art Unit 1642 
/P. J. RJ 

/Karen A Canella/ 

Primary Examiner, Art Unit 1643 



Appendix 1 

D86958 

LOCUS D86958 6614 bp mRNA linear PRI 15-JAN-2004 

DEFINITION Homo sapiens mRNA for KIAA0203 gene, partial cds . 

ACCESSION D86958 

VERSION D86958.1 GI:1503989 

KEYWORC 

SOURCE :!'."' er ( ' i 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
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Catarrhini; Hominidae; Homo. 

E 1 

AUTHORS Nagase,T., Seki,N., Ishikawa,K., Ohira,M., Kawarabayasi, Y . , 
Ohara,0., Tanaka,A., Kotani,H., Miyaj ima, N. and Nomura, N. 

TITLE Prediction of the coding sequences of unidentified human genes. VI. 

The coding seguences of 80 new genes (KIAA0201-KIAA0280) deduced by 
analysis of cDNA clones from cell line KG-1 and brain 
JOURNAL DNA Res. 3 (5), 321-329 (1996) 
PUBMED 9039502 
REFERENCE 2 (bases 1 to 6614) 

AUTHORS Ohara,0., Nagase,T., Kikuno,R. and Nomura, N. 
TITLE Direct Submission 

JOURNAL Submitted ( 02-AUG-l 996) Osamu Ohara, Kazusa DNA Research Institute; 
1532-3, Yana, Kisarazu, Chiba 292-0812, Japan 
(E-mail: cdnainfoL3kazusa.or.jp, Tel : +81-438-52-3 913) 

FEATURES Location/Qualifiers 

/organism="Homo sapiens" 
/mol_type = "mRNA" 
/db_xref="taxon: 9606" 
/chromosome="8" 

/cell_line="KG-l" 

/cell_type="myeloblast" 
gene <1. .6614 

/gene="KIAA0203" 
5'UTR 1. .515 

/gene="KIAA0203" 
CDS <510. .5291 

/gene="KIAA0203" 

/note="Start codon is not identified 
similar to mouse CC1." 

/codon_start=l 
/protein_id="BAA13194 .2" 
/db_xref="GI:4 0788 90 6" 

/translation="I IMKLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQH 
QVLWNGGECMAADRRVCTYSAGTDTNPIFLFNKEMILCDRPPAI PKTTFSTENDMEI 
KVEESLMMPAVFHTVASRTQLALEMYEVAKKLCSFCEGLVHDEHLQHQGWAAIMANLE 
DCSNSYQKLLFKFESIYSNYLQSIEDIKLKLTHLGTAVSVMAKIPLLECLTRHSYREC 
LGRLDSLPEHEDSEKAETKRSTELVLSPDMPRTTNESLLTSFPKSVEHVSPDTADAES 
GKEIRESCQSTVHQQDETTIDTKDGDLPFFNVSLLDWINVQDRPNDVESLVRKCFDSM 
SRLDPRIIRPFIAECRQTIAKLDNQNMKAIKGLEDRLYALDQMIASCGRLVNEQKELA 
QGFLANQKRAENLKDASVLPDLCLSHANQLMIMLQNHRKLLDIKQKCTTAKQELANNL 
HVRLKWCCFVMLHADQDGEKLQALLRLVIELLERVKIVEALSTVPQMYCLAWEWRR 
KMFIKHYREWAGALVKDGKRLYEAEKSKRESFGKLFRKSFLRNRLFRGLDSWPPSFCT 
QKPRKFDCELPDISLKDLQFLQSFCPSEVQPFLRVPLLCDFEPLHQHVLALHNLVKAA 
QSLDEMSQTITDLLSEQKASVSQTSPQSASSPRMESTAGITTTTSPRTPPPLTVQDPL 
CPAVCPLEELSPDSIDAHTFDFETIPHPNIEQTIHQVSLDLDSLAESPESDFMSAVNE 
FVIEENLSSPNPISDPQSPEMMVESLYSSVINAIDSRRMQDTNVCGKEDFGDHTSLNV 
QLERCRWAQDSHFSIQTIKEDLCHFRTFVQKEQCDFSNSLKCTAVEIRNI IEKVKCS 
LEITLKEKHQKELLSLKNEYEGKLDGLIKETEENENKIKKLKGELVCLEEVLQNKDNE 
FALVKHEKEAVICLQNEKDQKLLEMENIMHSQNCEIKELKQSREIVLEDLKKLHVEND 
EKLQLLRAELQSLEQSHLKELEDTLQVRHIQEFEKVMTDHRVSLEELKKENQQIINQI 
QESHAEI IQEKEKQLQELKLKVSDLSDTRCKLEVELALKEAETDEIKILLEESRAQQK 
ETLKSLLEQETENLRTEISKLNQKIQDNNENYQVGLAELRTLMTIEKDQCISELISRH 
EEESNILKAELNKVTSLHNQAFEIEKNLKEQIIELQSKLDSELSALERQKDEKITQQE 
EKYEAI IQNLEKDRQKLVSSQEQDREQLIQKLNCEKDEAIQTALKEFKLEREWEKEL 
LEKVKHLENQIAKSPAIDSTRGDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQ 
NVRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRA 
RLLEEKKKLEEEVSKLRSSSFVPSPYVATAPELYGACAPELPGESDRSAVETADEGRV 
DSAMETSMMSVQENIHMLSEEKQRIMLLERTLQLKEEENKRLNQRLMSQSMSSVSSRH 
SEKIAIRDFQVGDLVLIILDERHDNYVLFTVSPTLYFLHSESLPALDLKPASGASRRP 
WVLGKVMEKEYCQAKKAQNRFKVPLGTKFYRVKAVSWNKKV" 
3'UTR 5292. .6614 

/gene="KIAA0203" 

ORIGIN 
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Query Match 
Best Local Similarity 
Matches 6609; Conser 



Score 6587; I 
Pred. No. 0; 
0; Mismatches 



Length 6614; 

Indels 9; Gaps 

AACAAACCAAGCCGCGGCGGTGTCCGCGGCCCTGCCGAGCCCTCGGCGTTGCCTCAGAAT E 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AGCCCTCGGCGTTGCCTCAGAAT E 

ACAGGCCGCGGCCTTCTGTCCCCCGGCCCCA ] 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ccccca : : :t t;g/t : tga:a;;: ttitttiiiiittitcca 

130 GACCCAGAGCCGAGGGGCCTGCTCGC 3T " " TTGT TCGCCCGGACCCCTCCCTGCCTCCTA 1 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
. GACCCAGAGCCGAGGGGCCTGCTCGCGTCCTTGTCCGCCCGGACCCCTCCCTGCCTCCTA ] 

GAGTTCGGGGCCGCGGCGGGCGGGCGCCCGGGACGCCGGCGGTTGTGTCGGCTTAGCGGT 5 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GAGTTCGGGGCCGCGGCGGGCGGGCGCCCGGGACGCCGGCGGTTGTGTCGGCTTAGCGGT 5 

GCCGAATGGGCGGTTGGTAACCGCTGCCGAGGACTAGGCGGCGGCGGAAGATGGTGCCGG : 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCCGAATGGGCGGTTGGTAACCGCTGCCGAGGACTAGGCGGCGGCGGAAGATGGTGCCGG ; 

GGGTCGCTGGCTCTGCTGCTGCCGCCGGCGAAGGAGGAGGCGTTGCCGGTTTTCTGAGTT ; 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GGGTCGCTGGCTCTGCTGCTGCCGCCGGCGAAGGAGGAGGCGTTGCCGGTTTTCTGAGTT ; 

TAACCAGTAATGCCATTCAGTTGCCAATCTCAAGCAAAGCAAACATAAGCCAGTTTTAAT < 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TAACCAGTAATGCCATTCAGTTGCCAATCTCAAGCAAAGCAAACATAAGCCAGTTTTAAT < 

430 CTACTTTTTAAGAAAAGTGGTAGTCCTTTTCACAGTGCCTGACGTAACTGTATCAGAGGG < 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

421 CTACTTTTTAAGAAAAGTGGTAGTCCTTTTCACAGTGCCTGACGTAACTGTATCAGAGGG < 

4 90 TGAGGTATAAGCTCACAGAATTCAGATAAATCATCATGAAGTTATATGTATTTCTGGTTA £ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

. TGAGGTATAAGCTCACAGAATTCAGATAAATCATCATGAAGTTATATGTATTTCTGGTTA £ 

) AC AC T GGAAC T AC T C T AACAT T T GACAC T GAAC T T ACAGT GCAAAC T GT GGCAGACC T T A E 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

. ACACTGGAACTACTCTAACATTTGACACTGAACTTACAGTGCAAACTGTGGCAGACCTTA E 

610 AGCATGCCATTCAAAGCAAATACAAGATTGCTATTCAACACCAGGTGCTGGTGGTCAATG E 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGCATGCCATTCAAAGCAAATACAAGATTGCTATTCAACACCAGGTGCTGGTGGTCAATG < 

.GATCGAAGAGTGTGTACCTACAGTGCTGGGACGGATACAA " 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GAGGAGAATG1ATGGCTGCAGATCGAAGAGTGTGTACCTACAGTGCTGGGACGGATACAA " 

730 ATCCAATTTTTCTTTTTAACAAAGAAATGATCTTATGCGATCGTCCACCTGCTATTCCTA " 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
L ATCCAATTTTTCTTTTIAACAAAGAAATGATCTTATGTGATCGTCCACCTGCTATTCCTA - 

7 aaaatgacatggaaataaaagttgaagaatctct tat gat gc i 

/•/;•/• ; ;a:a?aaaa i;a: at; ;aaaiaaaa;~ ;aa;aai : i :"at.gaigc e 

tgttgcttcaaggacacagcttgcattggaaatgtatgaagttg 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

r: Tl ,, A',TTTTT ATA T -TT - TT A.-.--.-. A A- T TGCATTGGAAATGTATGAAGTTG i 

910 CCAAGAAA :TTTGT T : T T T T T ?T ?-AA ?- ?T : T T ?TA : AT ATT AAACATCTTCAACACCAAG i 
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Db 901 CCAAGAAACTTTGTTCTTTTTGTGAAGGTCTTGTACATGATGAACATCTTCAACACCAAG 960 

Qy 970 AAC C T GGAGGAC T GT T CAAAT T CAT ACCAAAAGC T AC T T T 1029 

Db 961 G' TGGG TG AAT ATGG ' ^ AAC C T GGAGGAC T GT T CAAAT T CAT ACCAAAAGC T AC T T T 1020 

Qy 1030 TCAAGTTTGAAAGTATTTATTCAAATTATCTGCAGTCCATAGAAGACATCAAGTTAAAAC 1089 

Db 1021 TCAAGTTTGAAAGTATT TAT T CAAAT TATCTGCAGTCCATAGAAGACATCAAGTTAAAAC 1080 

Qy 1 AGTTTCAGTAATGGCCAAGATTCCACTGTTGGAGTGCCTAA 1149 

Db 1081 T TACT CAT TTAGGAACTGCAGTTTCAGTAATGGCCAAGATTCCACTGTTGGAGTGCCTAA 1140 

ny II " > ^ A^AATGirr :z. TGA2 ATGAAGACT 1Z" j 

Db 1141 C C AGAC AT AG T T AC AGAGAAT GT T T GGGAAGAC T GGAT T C T T T ACC T GAACAT GAAGAC T 1200 

Oy 3AGACGAAAAGATCCACTGAACTGGTGCTCTCTCCTGATATGCCTAGAA 12 6 9 

Db 1201 CAGAAAAAGCTGAGACGAAAAGATCCACTGAACTGGTGCTCTCTCCTGATATGCCTAGAA 1260 

Qy 1270 CAACTAACGAATCTTTGTTAACCTCATTTCCCAAGTCAGTGGAACATGTGTCCCCAGATA 1329 

Db 1261 CAACTAACGAATCTTTGTTAACCTCATTTCCCAAGTCAGTGGAACATGTGTCCCCAGATA 1320 

Qy 133 0 CCGCAGATGCTGAAAGTGGCAAAGAAATTAGGGAATCTTGTCAAAGTACTGTTCATCAGC 13 8 9 

Db 1321 CCGCAGATGCTGAAAGTGGCAAAGAAATTAGGGAATCTTGTCAAAGTACTGTTCATCAGC 13 8 0 

Qy 1390 AAGATGAAACTACGATTGACACTAAAGATGGTGATCTGCCCTTTTTTAATGTCTCTTTGT 1449 

Db 13 81 AAGATGAAACTACGATTGACACTAAAGATGGTGATCTGCCCTTTTTTAATGTCTCTTTGT 144 0 

Qy 145 0 TAGACTGGATAAATGTTCAAGATAGACCTAATGATGTGGAATCTTTGGTCAGGAAGTGCT 15 0 9 

Db 1441 TAGACTGGATAAATGTTCAAGATAGACCTAATGATGTGGAATCTTTGGTCAGGAAGTGCT 15 0 0 

Qy 1510 TTGATTCTATGAGCAGGCTTGATCCAAGGATTATTCGACCATTTATAGCAGAATGCCGTC 1569 

Db 1501 TTGATTCTATGAGCAGGCTTGATCCAAGGATTATTCGACCATTTATAGCAGAATGCCGTC 1560 

Qy 1570 AAACTATTGCCAAACTTGATAATCAGAATATGAAAGCCATTAAAGGACTTGAAGATCGGC 1629 

Db 1561 AAAC T AT T GC CAAAC T T GAT AAT CAGAAT AT GAAAGCCAT T AAAGGAC T T GAAGAT CGGC 1620 

Qy 1630 TCTACGCCCTGGACCAGATGATTGCTAGCTGTGGCCGACTGGTGAATGAACAGAAAGAGC 1689 

Db 1621 TCTACGCCCTGGACCAGATGATTGCTAGCTGTGGCCGACTGGTGAATGAACAGAAAGAGC 168 0 

Qy 1690 T T GC T CAGGGAT T T T T AGC T AAT CAGAAGAGAGC T GAAAAC T T AAAGGAT GCATCTGTAT 1749 

Db 1681 TTGCTCAGGGATTTTTAGCTAATCAGAAGAGAGCTGAAAACTTAAAGGATGCATCTGTAT 1740 

Qy 1750 TACCTGATTTATGCCTGAGTCACGCAAATCAGTTGATGATTATGTTGCAAAATCATAGAA 1809 

Db 1741 TACCTGATTTATGCCTGAGTCACGCAAATCAGTTGATGATTATGTTGCAAAATCATAGAA 1800 

Qy 1810 AACTGTTAGATATTAAGCAGAAGTGTACCACTGCCAAACAAGAACTAGCAAATAACCTAC 1869 

Db 1801 AACTGTTAGATATTAAGCAGAAGTGTACCACTGCCAAACAAGAACTAGCAAATAACCTAC 18 60 

Qy 187 T T GC T T T GT AAT GC T T CAT GC T GAT CAAGAT GGAGAGAAG T 192 9 

Db 1861 ATGTCAGACTGAAGTGGTGTTGCTTTGTAATGCTTCATGCTGATCAAGATGGAGAGAAGT 1920 

Qy 1930 TACAAGCTTTGCTCCGCCTCGTAATAGAGCTGTTAGAAAGAGTCAAAATTGTTGAAGCTC 1989 
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:TTTGCTCCGCCTCGTAATAGAGCTGTTAGAAAGAGTCAAAATTGTTGAAGM Z I 

:agttcctcagatgtactgcttagctgttgttgaggttgtaagaagaaaaatgt S 

TTAGTACAGTTCCTCAGATGTACTGCTTAGCTGTTGTTGAGGTTGTAAGAAGAAAAATGT 2 
TCATAAAACACTACAGGGAGTGGGCTGGTGCTTTAGTCAAAGATGGAAAGAGATTATATG 2 
TCATAAAACACTACAGGGAGTGGGCTGGTGCTTTAGTCAAAGATGGAAAGAGATTATATG 2 



2230 
2221 



tgtttagggga :-a:t : /rGG/oooorroorrrrGiAoroAAAAGGGTGGAA 

ATCGTCTGTITAGGGGACTGGACTCCTGGCCCCCTTCCTTTTGTACTCAAAAGCCTCGAA 

AGTTTGAC TGTGAAC T T CCAGAT AT T T CAT T AAAAGAT T TACAGT T T C T GCAAT CATTTT 2 

TGTGAACTTCCAGATATTTCATTAAAAGATTTACAGTTTCTGCAATCATTTT 5 

'rGAAGTTCAGCCATTCCTCAGGGTTCCCTTACTTTGTGACTTTGAACCTCTAC 2 

-GAAGTTCAGCCATTCCTCAGGGTTCCCTTACTTTGTGACTTTGAACCTCTAC 2 

'GTACTTGCTCTACATAATTTGGTAAAAGCAGCACAAAGTTTGGATGAAATGT 2 



A> AG 'A' 



2470 
2461 
2530 
2521 
2 5 90 
2581 



2710 
2701 



■AAAAATi 
■AAAAATi 



GTACTTGCTCTACATAATTTGGTAAAAGCAGCACAAAGTTTGGATGAAATGT 2 



AAAAAGGGAATCCTTTGGGAAATTATTTAGGAAGTCTTTTTTAAGAA 
AAAAAGGGAATCCTTTGGGAAATTATTTAGGAAGTCTTTTTTAAGAA 



CACAGACCATTACAGATCTACTGAGTGAACAAAAGGCATCTGTGAGCCAGACATCCCCAC 
CACAGACCATTACAGATCTACTGAGTGAACAAAAGGCATCTGTGAGTCAGACATCCCCAC 

AGTCTGCTTCTTCACCAAGGATGGAAAGTACAGCAGGAATTACAACTACTACCTCACCGA 2 

AGTCTGCTTCTTCACCAAGGATGGAAAGTACAGCAGGAATTACAACTACTACCTCACCGA 2 

GAACTCCTCCACCACTGACTGTTCAGGATCCCTTATGTCCTGCAGTTTGTCCCTTAGAAG 2 

GAACTCCTCCACCACTGACTGTTCAGGATCCCTTATGTCCTGCAGTTTGTCCCTTAGAAG 2 

AATTATCTCCAGATAGTATTGATGCACATACGTTTGATTTTGAAACTATTCCCCATCCAA 2 
AATTATCTCCAGATAGTATTGATGCACATACGTTTGATTTTGAAACTATTCCCCATCCAA 

ACATAGAA A ,A TATT A AA^TTT ITTA\-. ZZ "-AIT ATTAGCAGAAAGTCCTG 2 

» » » ' AAC AGAC TATT CACCAAGT T T C T T T AGAC T T GGAT T CAT T AGCAGAAAGT CC T G 2 

AATCAGATTTTATGICTGCTGTGAATGAGTTTGTAATAGAAGAAAATTTGTCGTCTCCTA 2 

AATCAGATTTTATGTCTGCTGTGAATGAGTTTGTAATAGAAGAAAATTTGTCGTCTCCTA 2 

'70 ATCCTATAAGTGATCCACAAAGCCCAGAAATGATGGTGGAATCACTTTATTCATCAGTTA 2 

'61 ATCCTATAAGTGAT A AAA' jCC C AGAAAT GAT GGT GGAAT CAC TTTATTCATCAGTTA 2 

!30 TCAATGCGATAGACAGTAGACGAATGCAGGATACAAATGTATGTGGTAAGGAGGATTTTG 2 

121 TCAATGCGATAGACAGTAGACGAATGCAGGATACAAATGTATGTGGTAAGGAGGATTTTG 2 

!90 GAGATCATACTTCTCTGAATGTCCAGTTGGAAAGATGTAGAGTTGTTGCCCAAGACTCTC 2 

!81 GAGATCATACTTCTCTGAATGTCCAGTTGGAAAGATGTAGAGTTGTTGCCCAAGACTCTC 2 
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2 } 5 : ATT A -TATA : AAA : : AT TA_A - AAA -A : : T T T rCCAC T T T AGAACAT T T GT ACAAAAA - 3009 
ACC T T T GCCAC T T T AGAACAT T T GT AC AAAAAG 3 0 0 0 
AATTCATTAAAATGTACAGCAGTAGAAATAAGAAACATTATTG 3 q g 9 

0t aac ag t gt gac t t c t caaat t cat t aaaat gt acagcagt agaaat aagaaac at t at t g 3060 
taaaagaaaaacatcaaaaa'ia t2 t -i 3t2 9 
itaaaagaaaaacatcaaaaagaactactgt 3120 
iacggactaataaaggaaactgaagagaatg 318 9 
acggac t aat aaaggaaac t gaac 



aaaaagtaaaatgtt ;tctggaaataac 
3061 aaaaagtaaaatgttctctggaaataa( 
: t t t aaaaaat gaatat gaaggtaaac ' 
. t t t aaaaaat gaatat gaaggtaaac 
aaac aaaat t aaaaaat t gaagggai 



3 3 61 
3/130 



3490 
3481 



3610 
3601 



3730 
3721 
3790 
3781 
3850 
3841 



-ttag: 



;aatttgctttggttaaacatgaaaaagaagctgtaatctgcctg( 
aagataatgaatttgctttggttaaacatgaaaaagaagctgtaatctgcctgcagaatg 
aaaaggatcagaagttgttagagatggaaaatataatgcactctcaaaattgtgaaatta 
aaaaggatcagaagttgttagagatggaaaatataatgcactctcaaaattgtgaaatta 
aagaactgaagcagtcacgagaaatagtgttagaagacttaaaaaagctccatgttgaaa 
aagaactgaagcagtcacgagaaatagtgttagaagacttaaaaaagctccatgttgaaa 
atgatgagaagttacagttattgagggcagaacttcagtccttggagcaaagtcatctaa 
atgatgagaagttacagttattgagggcagaacttcagtccttggagcaaagtcatctaa 
aggaat t agaggacacac t tcaggt taggcacatacaagagt t tgagaaggt tatgacag 
aggaat tagaggacacact tcaggt taggcacatacaagagt t tgagaaggt tatgacag 
ac c ac agag t t t c t t t ggaggaat t aaaaaaggaaaat caacaaat aat t aat caaat ac 
accacagagt t tc t t tggaggaat taaaaaaggaaaaccaacaaat aat t aatcaaat ac 
aagaat c t cat gc t gaaat t at ccaggaaaaagaaaaacagt t ac aggaat taaaac t c a 
aagaat ctcatgct gaaat t at ccaggaaaaagaaaaacagt t acaggaat t aaaac t c a 
aggtttctgattt aag ttagaggtt gaac t t gc g t t gaaggaag 

aggtttctgatttgtcagacacgagatgcaagttagaggttgaacttgcgttgaaggaag 
cagaaac t gat gaaat aaaaat t t tgc tggaagaaagcagagcccagcagaaggagacc t 
cagaaac t gat gaaat aaaaat t t t gc t ggaagaaagcagagcccagcagaaggagacc t 
t gaaat ctcttctt gaacaagagacagaaaat t tgagaac agaaat tagtaaac t caacc 
t gaaat ctcttctt gaacaagagacagaaaat t t gagaacagaaat tagtaaac t caacc 
aaaagat t caggat aat aat g aaaat tat caggt gggc t t agcagagc t aagaac tttaa 
aaaagattcaggataataatgaaaattatcaggtgggcttagcagagctaagaactttaa 
t gacaat t gaaaaagat cagcgtat t tccgagt taat tagtagacatgaagaagaat c t a 
tgacaattgaaaaagatcagtgtatttccgagttaattagtagacatgaagaagaatcta 



3249 
324' 



3369 
3360 
3429 



3549 
3540 
3609 
3600 
3669 
3660 
3729 
3720 
3789 
3780 



3909 
3900 
3969 
3960 
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Qy 3970 AT AT AC T T AAAGC T GAAT T AAACAAAGTAACATCT T TGCATAACCAAGCAT T T G AAAT AG 4029 

Db 3961 ATATACTTAAAGCTGAATTAAACAAA -T-_- - Z ZZZ - -Z-S- -_- - -TTT ^AAATA^ 4"Z" 

Qy 4030 AAAAAAA TAAAAGAA AAATAATTGAACTGCAGAGTAAATTGGATTCAGAATTGAGTG 408 9 

Db 4021 AAAAAAA TAAAAGAA AAAT AAT T GAAC T GCAGAGT AAAT T GGAT T C AGAAT T GAG T G 4080 

Qy 4090 CTCTTGAAAGACAAAAAGATGAAAAAATTACCCAACAAGAAGAGAAATACGAAGCTATTA 4149 

Db 4081 C T C T T GAAAGACAAAAAGATGAAAAAAT TACCCAACAAGAAGAGAAAT ACGAAGC T AT T A 4140 

Qy 4150 T C C AGAAC C T T GAGAAAGAC AGAC AAAAAT T GGT CAGC AGCC AGGAGC AAGAC AGAGAAC 4209 

Db 4141 : AGACAAAAATTGGTCAGCAGCCAGGAGCAAGACAGAGAAC 42 0 0 

Qy 4210 AGTTAATTCAGAAGCTTAATTGTGAAAAAGATGAAGCTATTCAGACTGCCCTAAAAGAAT 4269 

Db 4201 AGTTAATTCAGAAGCTTAATTGTGAAAAAGATGAAGCTATTCAGACTGCCCTAAAAGAAT 4260 

Qy 4270 TTAAATTGGAGAGAGAAGTTGTTGAGAAAGAGTTATTAGAAAAAGTTAAACATCTTGAGA 4329 

Db 4261 TTAAATTGGAGAGAGAAGTTGTTGAGAAAGAGTTATTAGAAAAAGTTAAACATCTTGAGA 4320 

Qy 4330 ATCAAATAGCAAAAAGTCCTGCCATTGACTCTACCAGAGGAGATTCTTCAAGCTTAGTTG 4389 

Db 4321 ATCAAATAGCAAAAAGTCCTGCCATTGACTCTACCAGAGGAGATTCTTCAAGCTTAGTTG 4380 

Qy 43 90 CTGAACTTCAAGAAAAGCTTCAGGAAGAAAAAGCTAAGTTTCTAGAACAACTTGAAGAGC 444 9 

Db 4381 C T GAAC T T C AAGAAAAGC T T C AGGAAGAAAAAGC T AAGT T T C T AGAAC AACT T GAAGAG C 444 0 

Qy 445 0 AAGAAAAAAGAAAGAATGAAGAAATGCAAAATGTTCGAACATCTTTGATTGCGGAACAAC 45 0 9 

Db 4441 AAGAAAAAAGAAAGAATGAAGAAATGCAAAATGTTCGAACATCTTTGATTGCGGAACAAC 45 0 0 

Qy 4510 AGAC C AAT T T T AAC AC T G T T T T AAC AAGAGAGAAAAT GAGAAAAGAAAAC AT AAT AAAT G 4569 

Db 4501 AGACCAAT T T TAACAC TGT T T TAACAAGAGAGAAAATGAGAAAAGAAAACAT AAT AAAT G 4560 

Qy 4570 AT C T T AGT GATAAGT TGAAAAGTACAATGCAGCAACAAGAACGGGATAAAGAT T TGAT AG 4629 

Db 4561 ATCTTAGT GAT AAG T T GAAAAG T ACAAT GCAGCAACAAGAACGGGAT AAAGAT T T GAT AG 4620 

Qy 4630 AGT C AC T T T C T GAAGAT CGAGC TCGTTTGCTT GAGGAAAAGAAAAAGC T T GAAGAAGAAG 4689 

Db 4621 AGTCACTTTCT GAAGAT CGAGC TCGTTTGCTT GAGGAAAAGAAAAAGC T T GAAGAAGAAG 4680 

Qy 4690 TCAGT AAGT TGCGC AGT AGC AGT TT TGT TCCTTCACCATATGTAGCTACAGCCCCAGAAC 4749 

Db 4' -1 T A,TAA,TT, , A ,TA , A -Till ~II IT' 'ACCATATGTAGCTACAGCCCCAGAAC 4740 

Qy 4 7 T GAAC TCCCAGGT GAAT C AGAT AGAT CCGC TGT GGAAAC AG 480 9 

Db 4741 TTTATGGAG TTGT a AA T CCAGGTGAATCAGATAGST ? TGTGGAAACAG 4800 

Db 4801 CAGATGAAGGAAGAGTGGATTCAGCAATGGAGACAAGCATGATGTCTGTACAAGAAAATA 4 8 60 

Qy 4870 TTCATATGTTGTCTGAAGAAAAACAGCGGATAATGCTGTTAGAACGAACATTGCAATTGA 4 92 9 

Db 4861 TT ATAT-TT-T Z -Z-S- -- - - TGCTGTTAGAACGAACATTGCAATTGA 4920 

Qy 4930 AAGAAGAAGAAAATAAACGGTTAAATCAAAGACTGATGTCTCAGAGCATGTCT TCAGT AT 4989 
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5152 
5230 
5212 
5290 
5272 
5350 
5332 
VI I ( 
5392 

viv 

5452 
5530 
5512 
5590 
5572 



AAGAAGAAGAAAATAAACGGTTAAATCAAAGACTGATGTCTCAGAGCATGTCTTCAGTAT 
AAGGCATTCTGAAAAGATAGCTATTAGAGATTTTCAGGTGGGAGATTTGGTACTCA 
ATTCT GAAAAGAT AGC TAT T AGAGAT T T T CAGGT GGGAGAT 1 
ACGAACGCCATGACAATTATGTGTTATTTACTGTTAGTCCTi 

CGCCATGACAATTATGTGTTATTTACTGTTAGTCCTAC 
TCTGTA TG TGGATCTCAAACCAGGTGAGGGTGC 

TCTCTACCTGCCCTGGATCTCAAACCA ( 

TGGGTACTTGGAAAAGTAATGGAAAAAGAATACTGTt 
ATCTAGAAGACCCTGGGTACTCGGAAAAGTAATGGAAAAAGAATACTGTCAAGCCAAAA 
-GGCACAAAACAGAT T TAAAGT TCCT T TGGGGACAAAGT T T TACAGAGTGAAAGCCGT AT 
AGGCACAAAACAGATTTAAAGTTCCTTTGGGGACAAAGTTTTACAGAGTGAAAGCCGTAT 
C AT GGAAT AAGAAAG T AT AAC T T AT GGACAAAAT T AAT ACAT T C T AT GAC AT TTTTTTCT 
CATGGAATAAGAAAGTATAACTTATGGACAAAATTAATACATTCTATGACATTTTTTTCT 



, lATT'l'i. .'I' '■ 'TG' A'. VI 



57 52 
5830 
5812 
5890 
5872 
5950 



-CTCATTCATCACTCCAAAAACAGCAGGCCATCTTTTTATGCAAA 540 9 

5391 
54 f, 9 



GATTTGTCCTGCAGTGCTCATTCATCACTCCAAAAACAGCAGGCCATCTTTTTATGCAAA 
AGTCAGCGTGACAATATACTTCACTGGTGTACATCGTTTACTTTTTAACTGGCTTCATTT 
AGTCAGCGTGACAATATACTTCACTGGTGTACATCGTTTACTTTTTAACTGGCTTCATTT 
TAGGAATAATAAATTCATCAGAATCCTTGGCTGAATTAAAATGGTTTTTGTTTTTTGGTT 
TAGGAATAATAAATTCATCAGAATCCTTGGCTGAATTAAAATGGTTTTTGTTTTTTGGTT 



CATACCTTGTGCATTGTGGCTTATGATGAGCCATATTAATTGCCTGTTAAATATACACTA 
CATACCTTGTGCATTGTGGCTTATGATGAGCCATATTAATTGCCTGTTAAATATACACTA 
GCTTGAACTTAGATGTTAAATGTTATTATTAC ;AGCATTTGTCCTTTTGTGAAATCAGTA 
GCTTGAACTTAGATGTTAAATGTTATTATTACCAGCATTTGTCCTTTTGTGAAATCAGTA 
TCAGAATACTTGCACTCTTTAACACATTCTTTATAAAATGTATAAATTATTCAGAACTAT 



5-32 

5710 

5 5 92 T'5AGAATACTTGCACTCTTTAACACATTCTTTATAAAATGTATAAATTATTCAGAACTAT 5 75 



TTAAAATAAAGAGGAGTGTTATTGCATGCTGATAATCATTTTGAGTTTGCCTCAGTAGAT 
TTAAAATAAAGAGGAGTGTTATTGCATGCTGATAATCATTTTGAGTTTGCCTCAGTAGAT 
TTTCAGTTTTTTTAAATGCCCTTTGATGTTTCAAAAAAAAAAAGGA 
ACTAAAGCAAATTGTTTCAGTTTTTTTAAATGCCCTTTGATGTTTCAAAAAAAAAAAGGA 
AC T GT AAT T T GAT T GAC T GAT T T T AAGAT CAGCCAT AAGT AAT CAGCAAT CT T CAAAAGC 
ACTGTAATTTGATTGACTGATTTTAAGATCAGCCATAAGTAATCAGCAATCTTCAAAAGC 
AOIIICAGTGGATTGGTCATCTGGGTTCTAAAGGGAAGAGTCTGTGCTACTAACCATTTC 



5 1 ?' 0 
5169 
5151 
5229 
521 1 
5289 
5271 
5349 



552'! 
5511 
5589 
5571 
5649 



57" 9 
5691 
5769 



5 = 29 
5811 



5949 
5931 
6009 
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Db 5932 AC1TTCAGTGGATTGGTCATCTGGGTTCTAAAGGGAAGAGTCTGTGCTACTAACCATTTC 5991 

Qy 6010 AAATGCAGACI lAAAl CITCCCAACATCTTTATGACTCTAGAATAATCATATTGATGAAA 6069 

Db 5992 AAAT GC AGAC T C AAAC C T T C C C AAC AT C T T T AT GAC T C T AGAAT AAT CAT AT T GAT GAAA 6051 

Qy 6070 T C G T AAT TCATGGTT GAG T T T CAGAACAAAAGAT AT T CAT T GCACAT T AACCAT T T AGAG 6129 

Db 6052 TCGTAATTCATGGTTGAGTTTCAGAACAAAAGATATTCATTGCACATTAACCATTTAGAG 6111 

Qy 6130 GTCATTTAAAIAACAAAAIATTGTATTGTAAAAGAACTGTACAATTTTAAAACAAIAAAG 6189 

Db 6112 GTCATTTAAATAACAAAAIATTGTATTGTAAAAGAACTGTACAATTTTAAAACAAIAAAG 6171 

Qy 6190 ATTTGAACCTGTAAATGTGTGTGCCTTTTAAAGAAGGATACATTTTTAATATATTTGAGT 624 9 

Db 6172 ATTTGAACCTGTAAATGTGTGTGCCTTTTAAAGAAGGATACATTTTTAATATATTTGAGT 6231 

Qy 6250 GATTGCTGGGAAGTGTGAAAATATTGTTATGTATCATATCAAAGAGAAACATGTTTATTA 6309 

Db 6232 GATTGCTGGGAAGTGTGAAAATATTGTTATGTATCATATCAAAGAGAAACATGTTTATTA 6291 

Qy 6310 CAAAAATGTTCTTTAACTATATACTATGTAACAGGGTAAACAGTGTTATGTAGAATAGAA 63 6 9 

Db 6292 CAAAAATGTTCTTTAACTATATACTATGTAACAGGGTAAACAGTGTTATGTAGAATAGAA 6351 

Qy 6370 TTGTGTAAACTAGATCTTTAGAGAAGTTGCCATTGAGCAAAGTTATTTAAATGAGTTAGT 6429 

Db 6352 TTGTGTAAACTAGATCTTTAGAGAAGTTGCCATTGAGCAAAGTTATTTAAATGAGTTAGT 6411 

Qy 6430 TGAGTTGGATGAGAATTGTTTGAGGTTTGTTGCTAGAGAACAATAATAAAATAATTCTTT 6489 

Db 6412 TGAGTTGGATGAGAATTGTTTGAGGTTTGTTGCTAGAGAACAATAATAAAATAATTCTTT 6471 

Qy 64 90 TTCAGAAAATATTTAATTTCTTCATAAAAATAAGTTAAATATTTTTTTAAATATGTATAT 654 9 

Db 6472 TTCAGAAAATATTTAATTTCTTCATAAAAATAAGTTAAATATTTTTTTAAATATGTATAT 6531 

Qy 6550 CTAATAGTACAAAATGGAATAAACATCATAGTGTATAGAAAACTGAATTTGACAAGTTAA 6609 

Db 6532 CTAATAGTACAAAATGGAATAAACATCATAGTGTATAGAAAACTGAATTTGACAAGTTAA 65 91 

Qy 6610 TGAATAAATGAACAAATGATTTC 6632 

Db 6592 TGAATAAATGAACAAATGATTTC 6614 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
■ JOURNAL 



AUTHORS 
TITLE 



D86958 

Human male myeloblast 
D86958 

D86958 GI:1503989 
KIAA0203 protein. 
Homo sapiens (human) 



6614 bp mRNA 

mRNA for KIAA0203 protein 



linear PRI 22-AUG-1996 

complete cols . 



Eukaryotae; mit 1 eukaryotes; Metazoa; Chordata; 

Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 6614) 

Nomura, N. 

Direct Submission 

Submitted ( 02-AUG-l 996) Nobuo Nomura, Kazusa DNA Research 
Institute; 1532-3 Yanauchino, Kisarazu, Chiba 292, Japan 
(E-mail: cdnainfo@kazusa.or.jp, Tel : 0483-52-3 930 , Fax : 0483-52-3 93 1 : 



(1: as« 



> 661^ 



Nagase,T., Seki,N., Ishikawa,K 

Prediction of the coding sequences < 
The coding sequences of 80 new qene: 



Ohara,0. 



and Nomura, N. 
lentified human genes, 
i 0201 - KIAA 0280) 
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JOURNAL Unpublished (1996) 
FEATURES Location/Qualifiers 
source 1..6614 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9f.;0f.;" 

/chromosome="8" 

/sex="male" 

/cell_line="KG-l" 

/oell_ty;: e = "myeloblast" 

rene=" AA 
CDS 516. .5291 

/gene="KIAA0203" 

/note="The KIAA0203 protein has similarity to mouse CC1." 

/citation=[2] 

/codon_start=l 

/product="KIAA0203 protein" 

l\ i tein_i 1=" ' " 

/db_xref ="GI : 1503 99"b"" 

/trans lation="MKLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQV 
LWNGGECMAADRRVCTYSAGTDTNPIFLFNKEMILCDRPPAIPKTTFSTENDMEIKV 
EESLMMPAVFHTVASRTQLALEMYEVAKKLCSFCEGLVHDEHLQHQGWAAIMANLEDC 
SNSYQKLLFKFESIYSNYLQSIEDIKLKLTHLGTAVSVMAKIPLLECLTRHSYRECLG 
RLDSLPEHEDSEKAETKRSTELVLSPDMPRTTNESLLTSFPKSVEHVSPDTADAESGK 
EIRESCQSTVHQQDETTIDTKDGDLPFFNVSLLDWINVQDRPNDVESLVRKCFDSMSR 
LDPRI IRPFIAECRQTIAKLDNQNMKAIKGLEDRLYALDQMIASCGRLVNEQKELAQG 
FLANQKRAENLKDASVLPDLCLSHANQLMIMLQNHRKLLDIKQKCTTAKQELANNLHV 
RLKWCCFVMLHADQDGEKLQALLRLVIELLERVKIVEALSTVPQMYCLAWEWRRKM 
FIKHYREWAGALVKDGKRLYEAEKSKRESFGKLFRKSFLRNRLFRGLDSWPPSFCTQK 
PRKFDCELPDISLKDLQFLQSFCPSEVQPFLRVPLLCDFEPLHQHVLALHNLVKAAQS 
LDEMSQTITDLLSEQKASVSQTSPQSASSPRMESTAGITTTTSPRTPPPLTVQDPLCP 
AVCPLEELSPDSIDAHTFDFETIPHPNIEQTIHQVSLDLDSLAESPESDFMSAVNEFV 
IEENLSSPNPISDPQSPEMMVESLYSSVINAIDSRRMQDTNVCGKEDFGDHTSLNVQL 
ERCRWAQDSHFSIQTIKEDLCHFRTFVQKEQCDFSNSLKCTAVEIRNIIEKVKCSLE 
ITLKEKHQKELLSLKNEYEGKLDGLIKETEENENKIKKLKGELVCLEEVLQNKDNE FA 
LVKHEKEAVICLQNEKDQKLLEMENIMHSQNCEIKELKQSREIVLEDLKKLHVENDEK 
LQLLRAELQSLEQSHLKELEDTLQVRHIQEFEKVMTDHRVSLEELKKENQQIINQIQE 
SHAEI IQEKEKQLQELKLKVSDLSDTRCKLEVELALKEAETDEIKILLEESRAQQKET 
LKSLLEQETENLRTEISKLNQKIQDNNENYQVGLAELRTLMTIEKDQCISELISRHEE 
ESNILKAELNKVTSLHNQAFEIEKNLKEQIIELQSKLDSELSALERQKDEKITQQEEK 
YEAIIQNLEKDRQKLVSSQEQDREQLIQKLNCEKDEAIQTALKEFKLEREWEKELLE 
KVKHLENQIAKSPAIDSTRGDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNV 
RTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARL 
LEEKKKLEEEVSKLRSSSFVPSPYVATAPELYGACAPELPGESDRSAVETADEGRVDS 
AMETSMMSVQENIHMLSEEKQRIMLLERTLQLKEEENKRLNQRLMSQSMSSVSSRHSE 
KIAIRDFQVGDLVLI ILDERHDNYVLFTVSPTLYFLHSESLPALDLKPASGASRRPWV 
LGKVMEKEYCQAKKAQNRFKVPLGTKFYRVKAVSWNKKV" 
3.;U_TP 5292.. 6614 

ORIGIN 

1 aacaaaccaa gccgcggcgg 
61 ccccoagtcg cctgggcccc 
121 ' 
181 
241 ( 

301 gggtcgctgg ctctgctgot 
3 61 taaccagtaa tgccattcag 
421 ctacttttta agaaaagtgg 
431 tgaggtataa gctcacagaa 
541 acactggaac tactctaaca 
601 agcatgccat tcaaagcaaa 
6 61 gaggagaat ~\ catggctgca 

781 aaactacctt ttcgacagaa 



tgtccgcggc cctgccgagc 
tcggctctga caggccgcgg 
gctcgcgtcc ttgtccgccc 
cgggcgcccg ggacgccggc 
ccgctgccga ggactaggcg 
gccgccggcg aaggaggagg 
ttgccaatct caagcaaagc 
tagtcctttt cacagtgcct 
ttcagataaa tcatcatgaa 
tttgacactg aacttacagt 
tacaagattg ctattcaaca 
gatcgaagag tgtgtaccta 
aaagaaatga tcttatgtga 
aatgacatgg aaataaaagt 



cctcggcgtt gcctcagaat 
ccttctgtcc cccggcccca 
ggacccctcc ctgcctccta 
ggttgtgtcg gcttagcggt 
gcggcggaag atggtgccgg 
cgttgccggt tttctgagtt 
aaacataagc cagttttaat 
gacgtaactg tatcagaggg 
gttatatgta tttctggtta 
gcaaactgtg gcagacctta 
ccaggtgctg gtggtcaatg 
cagtgctggg acggatacaa 
tcgtccacct gctattccta 
tgaagaatct cttatgatgc 
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901 ccaagaaact ttgttctttt tgtgaaggtc ttgtacatga tgaacatctt caacaccaag 

961 gctgggctgc aatcatggcc aacctggagg actgttcaaa ttcataccaa aagctacttt 

1081 ttactcattt aggaactgca gtttcagtaa tggccaagat tccactgttg gagtgcctaa 

1141 ccagacatag ttaoagagaa tgtttgggaa gactggattc tttacctgaa catgaagact 

1201 1 i -iii ii i t- - - uaa agatccactg aactggtgct ctctcctgat atgoctagaa 

12 61 oaaotaacga atotttgtta acctcatttc ccaagtcagt ggaacatgtg tccccagata 
jaaagtggc aaagaaatta gggaatcttg tcaaagtact gttcatcagc 

1381 aagatgaaae tacgattgac aotaaagatg gtgatctgcc cttttttaat gtctctttgt 

1441 tagactggat aaatgttoaa gatagaccta atgatgtgga atctttggtc aggaagtgct 



1501 ttgattctat gagcaggctt gatccaagga ttattcgacc atttatagca gaatgccgtc 

1561 aaactattgo caaacttgat aatoagaata tgaaagccat taaaggaott gaagatcggc 

1621 tctacgccct ggaccagatg attgotagot gtggc;gact ggtgaatgaa cagaaagagc 

1681 ttgct iiii it' ttagot aat a a ragct jaaaa cttaaaggat goatotgtat 

1741 tacctgattt atgcctgagt cacgcaaatc agttgatgat tatgttgcaa aatcatagaa 

1801 aactgttaga tattaagoag aagtgtacca ctgccaaaca agaactagca aataacctac 

18 61 atgtcagact gaagtggtgt tgctttgtaa tgcttcatgc tgatcaagat ggagagaagt 

1921 tacaag ttt it i taatagagc tgttagaaag agtcaaaatt gttgaagctc 

1981 ttagtacagt tcctcagatg tactgcttag ctgttgttga ggttgtaaga agaaaaatgt 

2 041 tcataaaaca ctacagggag tgggctggtg ctttagtcaa agatggaaag agattatatg 

2101 aagcagaaaa atoaaaaagg gaatcctttg ggaaattatt taggaagtct tttttaagaa 

2161 atcgtctgtt taggggactg gactcctggc ccccttcctt ttgtactcaa aagcctcgaa 

2221 agtttgactg tgaacttcca gatatttcat taaaagattt acagtttctg caatcatttt 

2281 gtccttcgga agttcagcca ttcctcaggg ttcccttact ttgtgacttt gaacctctac 

2341 accagcatgt acttgctcta cataatttgg taaaagcagc acaaagtttg gatgaaatgt 

2401 cacagaccat tacagatcta ctgagtgaac aaaaggcatc tgtgagtcag acatccccac 

24 61 agtctgcttc ttcaccaagg atggaaagta cagcaggaat tacaactact acctcaccga 

2521 gaactcctcc accactgact gttcaggatc ccttatgtcc tgcagtttgt cccttagaag 

2581 aattatctcc agatagtatt gatgcacata cgtttgattt tgaaactatt ccccatccaa 

2641 acatagaaca gactattcac caagtttctt tagacttgga ttcattagca gaaagtcctg 

2701 aatcagattt tatgtctgct gtgaatgagt ttgtaataga agaaaatttg tcgtctccta 

2761 atcctataag tgatccacaa agcccagaaa tgatggtgga atcactttat tcatcagtta 

2 821 tcaatgcgat agacagtaga cgaatgcagg atacaaatgt atgtggtaag gaggattttg 
2881 gagatcatac ttctctgaat gtccagttgg aaagatgtag agttgttgcc caagactctc 
2941 acttcagtat acaaaccatt aaggaagacc tttgcoactt tagaacattt gtacaaaaag 
3001 aacagtgtga cttctcaaat tcattaaaat gtacagcagt agaaataaga aacattattg 
3061 aaaaagtaaa atgttctctg gaaataacac taaaagaaaa acatcaaaaa gaactactgt 
3121 ctttaaaaaa tgaatatgaa ggtaaacttg acggactaat aaaggaaact gaagagaatg 
3181 aaaacaaaat taaaaaattg aagggagagt tagtatgcct tgaggaggtt ttacaaaata 
3241 aagataatga atttgctttg gttaaacatg aaaaagaagc tgtaatctgc ctgcagaatg 
3301 aaaaggatca gaagttgtta gagatggaaa atataatgca ctctcaaaat tgtgaaatta 
33 61 aagaactgaa gcagtcacga gaaatagtgt tagaagactt aaaaaagctc catgttgaaa 
3421 atgatgagaa gttacagtta ttgagggcag aacttcagtc cttggagcaa agtcatctaa 
3481 aggaattaga ggacacactt caggttaggc acatacaaga gtttgagaag gttatgacag 
3541 ig gaattaaaaa aggaaaacca acaaataatt aatcaaatac 
3601 aagaatctca tgctgaaatt atccaggaaa aagaaaaaca gttacaggaa ttaaaactca 

3 6 61 aggtttctga tttgtcagac acgagatgca agttagaggt tgaacttgcg ttgaaggaag 

3 721 cagaaactga tgaaataaaa attttgctgg aagaaagcag agcccagcag aaggagacct 
3781 tgaaatctct tcttgaacaa gagacagaaa a^t gagaac agaaattagt aaactcaacc 
3841 aaaagattca ggataataat gaaaattatc aggtgggctt aa 
3901 tgacaattga aaaagatcag tgtatttccg agttaattag tagacat aa taagaat ta 
3961 atatacttaa agctgaatta aacaaagtaa catctttgca taaccaagca tttgaaatag 
4021 aaaaaaacct aaaagaacaa ataattgaac tgcagagtaa attggattca gaattgagtg 
4081 ctcttgaaag acaaaaagat gaaaaaatta cccaacaaga agag; 

4141 tccagaacct tgagaaagac agacaaaaat tggtcagcag ccagqi i agagaa 
4201 aaaag atgaagctat tcagactgcc ctaaaagaat 

42 61 ttaaattgga gagagaagtt gttgagaaag agttattaga aaaagttaaa catcttgaga 

4321 atcaaatagc aaaaagtcct gccattgact ctaccagagg agattcttca agcttagttg 

4381 ctgaacttoa agaaaagctt caggaagaaa aagctaagtt tctagaacaa ctt raagag 
4441 I gaaatgcaaa atgttcgaac atctttgatt gcggaacaac 

4501 agaccaattt taacactgtt ttaacaagag agaaaatgag aaaagaaaac ataataaatg 

45 61 atcttagtga taagttgaaa agtacaatgc agcaacaaga acgggataaa gatttgatag 

4 621 agtcactttc tgaagatcga gctcgtttgc ttgaggaaaa gaaaaagctt gaagaagaag 
4 681 tcagtaagtt gcgcagtagc agttttgttc cttcaccata tgtagctaca gccccagaac 
4 741 tttatggagc ttgtgcacct gaactcccag gtgaatcaga tagatccgct gtggaaacag 
4801 cagatgaagg aagagtggat tcagcaatgg agacaagcat gatgtctgta caagaaaata 
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4 921 aagaagaaga aaataaacgg ttaaatcaaa 
4981 cttcaaggca ttctgaaaag atagctatta 

5101 ttctacattc agagtctcta cctgccctgg 
5161 

5221 acaga- 

5401 gacaatatac ttcactggtg tacatcgttt 

54 61 taaattcate agaatccttg gctgaattaa 

5 5 21 

5581 tgeattgtgg ottatgatga jooa^a^aa 

5 641 t a g a t g 1 1 a a a t ■ j 1 1 a 1 1 a t 

5 7 61 agaggagtgt tattgoatgc tgataatcat 

5 821 aattgtttca gttttttts 

5881 tgattgactg attttaagat cagccataag 

5 941 ggattggtca tctgggttct aaagggaaga 

6001 ctcaaacctt cccaacatct ttatgactct 

60 61 atggttgagt ttcagaacaa aagatattca 

6121 ataacaaaat attgtattgt aaaagaactg 

6181 tgtaaatgtg tgtgcctttt aaagaaggat 

6241 gaagtgtgaa aatattgtta tgtatcatat 

63 01 tctttaacta tatactatgt aacagggtaa 

63 61 ctagatcttt agagaagttg ccattgagca 

6421 tgagaattgt ttgaggtttg ttgctagaga 

6481 tatttaattt cttcataaaa ataagttaaa 

6541 caaaatggaa taaacatcat agtgtataga 

6601 gaacaaatga tttc 



taatgctgtt agaacgaaca ttgcaattga 
gactgatgtc tcagagcatg tcttcagtat 
gagattttca ggtgggagat ttggtactca 
tgttatttac tgttagtcct actttatatt 
atctcaaacc agcttcaggt gcatctagaa 
aagaatactg tcaagccaaa aaggcacaaa 
tttacagagt gaaagccgta tcatggaata 
cattctatga catttttttc tgatttgtcc 
caggccatct ttttatgcaa aagtcagcgt 

aatggttttt gttttttggt tttttttttt 
ctacttcatt ttctcaaagg gcataccttg 
ttgcctgtta aatatacact agcttgaact 
tgtccttttg tgaaatcagt atcagaatac 
gtataaatta ttcagaacta tttaaaataa 
tttgagtttg cctcagtaga tactaaagoa 
atgtttcaaa aaaaaaaagg aactgtaatt 
taatcagcaa tcttcaaaag cactttcagt 
gtctgtgcta ctaaccattt caaatgcaga 
agaataatca tattgatgaa atcgtaattc 
ttgcacatta accatttaga ggtcatttaa 
tacaatttta aaacaataaa gatttgaacc 
acatttttaa tatatttgag tgattgctgg 
caaagagaaa catgtttatt acaaaaatgt 
acagtgttat gtagaataga attgtgtaaa 
aagttattta aatgagttag ttgagttgga 
acaataataa aataattctt tttcagaaaa 
tattttttta aatatgtata tctaatagta 
aaactgaatt tgacaagtta atgaataaat 




Blast 2 Sequences results 

BLAST 2 SEQUENCES RESULTS VERSION BLASTP 2.2.18 [Mar-02-2008] 

:t gap extension: 
I 3 Filter 

Masking color option 



w . r'BLOSUM62 ill 

Matrix 3 ssa 



x dropoff: I 



expect:^ 300 ° wordsizc: 



n , ,• • Standard 

View option 3 



Masking character option 

l Black 



Show CDS translation 



X for protein, n for nucleotide 



Sequence 1: gi|40788906|KIAA0203 [Homo sapiens] 
Length = 1593 (1 .. 1593) 

Sequence 2: gi|l 19607126|RBl-inducible coiled-coil 1, isoform CRA b [Homo sapiens] 
>gi|168272926|dbj|BAG10302.1| RBI -inducible coiled-coil protein 1 [synthetic construct] 
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Length = 1591 (1 .. 1591) 




N< > } K.Bitscore and expect value are calculated based on the size of the nr database. 



= 1591/1591 (100%), Gaps = 0/1591 (0%) 

MKLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQVLWNGGECMAADRRVC 62 
MKLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQVLWNGGECMAADRRVC 
MKLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQVLWNGGECMAADRRVC 60 



Query 63 
Sbjct 61 



TYSAGTDTNPIFLFNKEMILCDRPPAIPI 
TYSAGTDTNPIFLFNKEMILCDRPPAIP] 
TYSAGTDTNPIFLFNKEMILCDRPPAIPI 



1EIKVEESLMMPAVFHTVASRTQ 122 
IEIKVEESLMMPAVFHTVASRTQ 
1EIKVEESLMMPAVFHTVASRTQ 120 



LALEMYEVAKKLCSFCEGLVHDEHLQHQGWAAIMANLEDCSNSYQKLLFKFESI YSNYLQ 
LALEMYEVAKKLCSFCEGLVHDEHLQHQGWAAIMANLEDCSNSYQKLLFKFESI YSNYLQ 
LALEMYEVAKKLCSFCEGLVHDEHLQHQGWAAIMANLEDCSNSYQKLLFKFES I YSNYLQ 

SIEDIKLKLTHLGTAVSVMAKIPLLECLTRHSYRECLGRLDSLPEHEDSEKAETKRSTEL 
SIEDIKLKLTHLGTAVSVMAKIPLLECLTRHSYRECLGRLDSLPEHEDSEKAETKRSTEL 
SIEDIKLKLTHLGTAVSVMAKIPLLECLTRHSYRECLGRLDSLPEHEDSEKAETKRSTEL 

VL3PDMPRTTNE3LLTSFPK3VEHVSPDTADAESGKEIRESCQSTVHQQDETTIDTKDGD 
VLSPDMPRTTNESLLTSFPKSVEHVSPDTADAESGKEIRESCQSTVI 

VLSPDMPRTTNESLLTSFPKSVEHVSPDTADAESGKEIRESCQSTVHQQDETTIDTKDGD 

LPFFNVSLLDWINVQDRPNDVESLVRKCFDSMSRLDPRI IRPFIAECRQTIAKLDNQNMK 
LPFFNVSLLDWINVQDRPNDVESLVRKCFDSMSRLDPRI IRPFIAECRQTIAKLDNQNMK 
LPFFNVSLLDWINVQDRPNDVESLVRKCFDSMSRLDPRI IRPFIAECRQTIAKLDNQNMK 



AIKGLEDRLYALDQMIASCGRLVNEQKELAQC 
AIKGLEDRLYALDQMIASCGRLVNEQKELAQC 

ii lc -"'ii :^,-k:ttakqelannlhvri 

MI ML Z KbLF ; Y - T~ ' jELANNLHVRI 

iml - _ :k,ik ttakqelannlhvri 



iqkraenlkdasvlpdlclshanql 
iqkraenlkdasvl p dlc l s h anql 
iqkraenlkdasvlpdlclshanql 

;fvmlhadqdgeklqallrlviell 
;fvmlhadqdgeklqallrlviell 
;fvmlhadqdgeklqallrlviell 



ervkivealstvpqmyclawewrrkmfikhyrewagalvkdgkrlyeaekskresfgk 542 
ervkiveali mrkmfikhyrewagalvkdgkrlyeaekskresfgk 

ERVKIVEALSTVPQMYCLAWEWRRKMFIKHYREWAGALVKDGKRLYEAEKSKRESFGK 54 0 

LFRKSFLRNRLFRGLDSWPPSFCTQKPRKFDCELPDISLKDLQFLQSFCPSEVQPFLRVP 602 
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LLCDFEPLHQHVLALHNLVKAAQSLDEMSQTIi:,!! IA 
LLCDFEPLHQHVLALHNLVKAAQSLDEMSQTITDLLSEQKASVSQTSPQSASSPRME3TA 
LLCDFEPLHQHVLALHNLVKAAQSLDEMSQTITDLLSEQKASVSQTSPQSASSPRMESTA 



GITTTT PR.TPPPLT^ III PAVCPLEELSPDSIDAHTFDFETIPHPNIEQTIHQVSLL 
GITTTT3PRTPPPLTVQDPL3PAVCPLEELSPDSIDAHTFDFETIPHPNIEQTIHQVSLD 
GIT TFDFETIPHPNIEQTIHQVSLD 

LDSLAESPESDFMSAVNEFVIEENLSSPNPISDPQSPEMMVESLYSSVINAIDSRRMQDT 
DPQSPEMMVESLYSSVINAIDSRRMQDT 
JEFVIEENLSSPNPISDPQSPEMMVESLYSSVINAIDSRRMQDI 



:qtikedlchfrtfvqkeqcdfsnslkct 
:qtikedlchfrtfvqkeqcdfsnslkct 
:qtikedlchfrtfvqkeqcdfsnslkct 



841 AVEIRI 



Sbjct 
Sbjct 
Sbjct 



1201 
1263 



1321 
1383 



1503 
1501 



CLEEVLQNKDNEFALVKHEKEAVICLQNEKDQKLLEMENIMHSQNCEIKELKQSREIVLE 
CLEEVLQNKDNEFALVKHEKEAVI CLQNEKDQKLLEMEN IMHSQNCE IKELKQSRE I VLE 
CLEEVLQNKDNEFALVKHEKEAVICLQNEKDQKLLEMENIMHSQNCEIKELKQSREIVLE 

DLKKLHVENDEKLQLLRAELQSLEQSHLKELEDTLQVRHIQEFEKVMTDHRVSLEELKKE 
DLKKLHVENDEKLQLLRAELQSLEQSHLKELEDTLQVRHIQEFEKVMTDHRVSLEELKKE 
DLKKLHVENDEKLQLLRAELQSLEQSHLKELEDTLQVRHIQEFEKVMTDHRVSLEELKKE 

NQQI INQIQESHAEIIQEKEKQLQELKLKVSDLSDTRCKLEVELALKEAETDEIKILLEE 
NQQI INQIQESHAEI IQEKEKQLQELKLKVSDLSDTRCKLEVELALKEAETDEIKILLEE 
NQQI INQIQESHAEI IQEKEKQLQELKLKVSDLSDTRCKLEVELALKEAETDEIKILLEE 

SRAQQKETLKSLLEQETENLRTEISKLNQKIQDNNENYQVGLAELRTLMTIEKDQCISEL 
SRAQQKETLKSLLEQETENLRTEISKLNQKIQDNNENYQVGLAELRTLMTIEKDQCISEL 
SRAQQKETLKSLLEQETENLRTEISKLNQKIQDNNENYQVGLAELRTLMTIEKDQCISEL 
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FQVGDLVLI ILDERHDNYVLFTVSPTLYFLHSESLPALDLKPASGASRRPWVLGKVMEKE 
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Application/Control Number: 10/516,558 
Art Unit: 1642 

Query 1563 YCQAKKAQNRFKVPLGTKFYRVKAVSWNKKV 1593 

Y C QAKKAQ NR. F K V P L G T K F YRVKAVS WNKKV 
Sbjct 1561 YCQAKKAQNRFKVPLGTKFYRVKAVSWNKKV 1591 



